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Let us begin with a mathematical personality quiz. 


Consider the five 



classes of functions given by polynomials, power series that are entire or 
converge everywhere, power series which converge on the unit disk, power 
series which converge on a disk of some positive radius, and formal power 
series. Each of these classes contains the previous one. For the quiz one 
should arrange them according to preference. 



1 Real and complex numbers 

Let Q, R, and C denote the fields of rational, real, and complex numbers, 
respectively. As usual, a complex number z can be expressed as x \ yi^ 
where x and y are real numbers, and %^ = —1, and we call x, y the real and 
imaginary parts of z and denote them Re z, Im z, respectively. We write Z 
for the integers, and Z_|_ for the set of positive integers. 

Let us recall that the rational numbers are dense in the real numbers 
in the sense that if x, y are real numbers such that x < y, then there is a 
rational number r such that 
(LI) X < r < y. 

Also, for each positive real number x there is a positive integer n such that 
(1.2) nx>l, 

2 



which is the same as saying that 

(1.3) - <x. 

n 

liz — x + yi is a. complex number, y e R, then the complex conjugate 
of z is denoted z and defined to he x — yi. Thus 

(1.4) 2Rez = z + z 
and 

(1.5) 2ilmz = z — z 
for all 2; e C. Furthermore, 

(1.6) z + w — z + w 

and 

(1.7) zHJ — zw 

for any complex numbers z, w. 

If X is a real number, then the absolute value of x is denoted |a;| and is 
defined to be equal to x when x > and to be equal to —x when a; < 0. 
Thus I a; I is always a nonnegative real number which is equal to if and only 
if X = 0. The triangle inequality for the absolute value function states that 

(1.8) \x + y\< \x\ + \y\ 

for all a;, 7/ e R, and this is easy to verify from the definitions. We also have 
that 

(1.9) \xy\^\x\\y\ 
for all x,y 

Uz = x-\-yi is a complex number, with x,y £ R, then the norm or 
modulus of z is denoted 1^1 and defined by 



(1.10) \z\ = ^x^ + y^. 

If z happens to be a real number, then this is the same as the absolute 
value of z as in the preceding paragraph. We can also describe \z\ as the 
nonnegative real number such that 

(1.11) Izl^^zz. 
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Notice that j^j = if and only if z — 0. Also, 

(1.12) \Rez\,\lmz\ <\z\ 

for any complex number z. 

U z, w are complex numbers, then 

(1.13) l^ + wl^ = {z + w){z + W) — zz + zUJ + zw + wW 
and therefore 

(1.14) \z + w\^ = \z\^ + 2Re{zw) + \w\'^ 

< \z\'^ + 2\z\ \w\ + \w\'^ ^ (\z\ + \w\f . 

In other words, 

(1.15) 1^ + w| < + 1^1 
for all complex numbers z, w. We also have that 

(1.16) \zw\ — \z\ \w\ 

for all z,w e C, since the complex conjugate of a product is equal to the 
product of the corresponding complex conjugates. 

The real numbers form a metric space with the standard metric and 
the complex numbers form a metric space with the standard metric 1^; — wj. 
One can view the real numbers as a subspace of the complex numbers, since 
the standard metric on R is the same as the restriction of the standard 
metric on C to R. Notice that the set of rational numbers is dense inside the 
real line in the sense of metric spaces, and that the complex numbers with 
rational real and imaginary parts are dense in the complex plane. 

If {zj}'^i, {wj}'^i are sequences of complex numbers which converge to 
z,w & C, then 

(1.17) lim Zj + Wj — z + w 
and 

(1.18) lim Zj Wj — zw. 

If is a sequence of nonzero complex numbers which converges to 

z E C, z 0, then 

1 1 

1.19 lim — = -. 

j^oo Zj Z 
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For any sequence of complex numbers {-Zjl^^i which converges to a complex 
number z, we have that 

(1.20) lim \zA = \zl 

as one can show using the fact that 

(1.21) ||a| - < |a -6| 

for all complex numbers a, 6, which is a consequence of the triangle inequality. 

Of course the same statements hold for the real numbers as a special case. 
Let {zj}'^^ be a sequence of complex numbers. One can check that 
is a Cauchy sequence if and only if the sequences 

(1.22) {Re^,}~i, {lT^z,}f^, 

of real and imaginary parts of the Zj's are Cauchy sequences as sequences 
of real numbers. If 2; is a complex number, then {z^}^]^ converges to z if 
and only if {Yie Zj}^^, {Im^j}?^^ converge to Re 2;, Imz, respectively, as 
sequences of real numbers. A basic result states that the real line with the 
standard metric is complete as a metric space, which is to say that every 
Cauchy sequence of real numbers converges. It follows that the complex 
numbers are also complete as a metric space. 

There is another notion of completeness for the real numbers, which is 
based on ordering. Suppose that A is a subset of the real line. A real number 
h is said to be an upper hound for A is a <h for all a G A. A real number 
c is said to be a least upper hound or supremum of ^4 if c is an upper bound 
of A and if c < 6 for every upper bound h of A. It is easy to see that the 
supremum of A is unique if it exists. 

As an ordered set, the real numbers are complete in the sense that every 
nonempty set A of real numbers which has an upper bound has a least upper 
bound. One often starts with this and derives completeness in the sense of 
convergence of Cauchy sequences. One can also start with completeness in 
the sense of convergence of Cauchy sequences and derive completeness in the 
sense of ordering. 

A sequence {xj}JLi of real numbers is said to be monotone increasing 
if Xj < Xj-^i for all positive integers j. If there is an upper bound for the 
Xj's, then the sequence {xj}'j?^i converges, and the limit is the supremum 
of the x/s. If one starts with completeness of the real numbers in terms of 
ordering, then this statement follows easily from the definitions, and at any 
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rate the convergence of monotone increasing sequences of real numbers which 
are bounded from above is basically an equivalent form of completeness. 

Notice that every bounded subset of the real or complex numbers is totally 
bounded, which is to say that it can be expressed as the union of finitely many 
subsets of arbitrarily small diameter. Basically this reduces to the fact that 
every interval in the real line can be expressed as the union of finitely many 
subintcrvals of arbtrarily small length. It follows from the completeness of 
the real and complex numbers as metric spaces that a subset of the real or 
complex numbers is compact if and only if it is closed and bounded. 

Let Z^^o be an infinite series with terms aj e C. We say that this series 
converges if the sequence of partial sums si — Z^j=o % converges, in which 

CB^SG ^ Q Q/j IS defined to be lim^^ooS;. Of course the sequence of partial 

sums converges if and only if it is a Cauchy sequence, which is equivalent 
to saying that J2^o aj converges if and only if for each e > there is an 

< e whenever m > I > L. This is known as the 



L > so that 



Cauchy criterion for convergence of J^JLo^^j^ ^^^d it follows that if J^jLo^^j 
converges, then limj^^ aj = 0. If J2jLo % is an infinite series of nonnegative 
real numbers, then the sequence of partial sums is a monotone increasing 
sequence of nonnegative real numbers, and hence converges if and only if the 
sequence of partial sums has an upper bound. 

An infinite series J2'jLo of complex numbers is said to converge absolutely 
if the series J2jLo l^jl converges. Absolute convergence implies ordinary con- 
vergence, because of the Cauchy criterion. A series Y.'jLo cij of complex num- 
bers converges absolutely if and only if Yl'^o % converges for any sequence 
^0,^1, •• ■ of complex numbers such that \9j\ < 1 for all j. If J2JLo<^j 
J2'jLo bj are convergent series of complex numbers and a, (3 are complex num- 
bers, then Y^Y=oi^'^j Pbj) also converges, and if J2jLo(ij, J2jLobj converge 
absolutely, then J^'jLoi'^^^j + /^^i) converges absolutely too. A series which 
converges but does not converge absolutely is said to converge conditionally. 

Let z be a complex number, and consider the series Yl'jLoZ-', where as 
usual is interpreted as being equal to 1 when j = 0, even when z = 0. 
li \z\ > 1, then \z\^ > 1 for all j. When \z\ < 1, it is well-known that 
lim^^oo z^ — 0. For each positive integer n we have that Yl]=o is equal to 
(1 — — z) assuming 2; 7^ 1. It follows that Yf^=oZ^ converges when 

1^1 < 1, and in fact converges absolutely, with the sum equal to 1/(1 — z). 

The Leibniz alternating series test states that if &2, • • • is a monotone 
decreasing sequence of nonnegative real numbers which converges to 0, then 
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the series Y^'^oi—lYbj converges. This can be verified using the Cauchy 
criterion. More generally, suppose that the 6j's have the same property and 
that we have a sequence of complex numbers Cq, Ci, . . . such that the partial 
sums Z)j=o bounded. In this case one can again show that J2jLo 

converges. These results can be used to give examples of series which converge 
but do not converge absolutely. 

If Co, ai, . . ., are complex numbers, then we get the associated power se- 
ries I]jlo%-2"'- ^^^^ scries converges for some particular zq G C, then 
limj^ooaj^o = 0) the sequence of aj Zq^s is bounded. In this case one 
can check that Sj^o '^j converges absolutely for all complex numbers z 
such that 1 2; I < 1 2^0 1, and Y^^QttjZ^ converges absolutely when \z\ < \zo\ if 
Yl'jLo dj Zq converges absolutely. The radius of convergence R of Yl'jLo % , 
< i? < cxD, is characterized by saying that J2'jLo'^j converges absolutely 
when \z\ < R and docs not converge at all when \z\ > R. Any power series 
converges at 0, and for a complex number z with \z\ equal to the radius 
of convergence of the series it may be that the series does not converge, or 
converges conditionally, or converges absolutely. 



2 p-Adic numbers 

Fix a prime number p, which is to say a positive integer p which is divisible 
only by 1 and itself. The p-adic absolute value of a rational number x is 
denoted \x\p and defined as follows. If a; = 0, then \x\p = 0. If a; 7^ 0, then x 
can be expressed as 

(2.1) 

n 

for some integer k and nonzero integers m, n, where neither m nor n is an 
integer multiple of p, and one puts 

(2.2) \x\p = p-'. 

Thus \x\p is small if x has a lot of factors of p, and it is large if x has a lot 
of factors of 1/p. 

Thus \x\p is always a nonnegative real number and \x\p = if and only if 
X = 0. The p-adic absolute value satisfies a stronger version of the triangle 
inequality, called the ultrametric version, which is that 

(2.3) \x + y\p<max{\x\p,\y\p) 
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for all x,y e Q. This is not too difficult to verify, and we also have that 

(2.4) \xy\p= \x\p\y\p 
for all x,y E Q. 

Fix a prime number p. The field of p-adic numbers is denoted Qp. Ba- 
sically Qp completes the rational numbers with respect to the p-adic metric 
in the same way that the real numbers complete the rational numbers with 
respect to the standard metric. To be more precise, the p-adic numbers Qp 
contain a copy of the rational numbers Q. The p-adic absolute value function 
extends to Qp, is also denoted | • |p, and satisfies the properties that \x\p is 
equal to when x = and is equal to p' for some integer / when x ^ 0. 

As on Q, we have that |a; + |/| < max(|a;|p, \y\p) and \xy\p = \x\p \y\p for 
all X, y e Qp. The function \x — y\p defines an ultrametric on Qp, extending 
the p-adic metric on Q. With respect to this ultrametric, Q is a dense subset 
of Qp and Qp is complete. 

Let {xj}'^^ and be sequences of p-adic numbers which converge 

to x,y E Qp, respectively. In this event we have that 

(2.5) lim Xj + yj ^ X + y 

and 

(2.6) lim Xj yj = xy 

just as for real and complex numbers. If Xj ^ for all j and x ^ 0, then 

(2.7) \imxJ^^x-K 

i->oo ■' 

Let Z^j^o S tie an infinite series whose terms are p-adic numbers. Just 
as for series of real or complex numbers, we say that the series converges if 
the sequence of partial sums converges in Qp. If J2'jLo % is an infinite series 
of p-adic numbers which converges and a is a p-adic number, then JZ'jLq ot aj 
also converges. If Z^jlo ^jy Sjlo infinite series of p-adic numbers which 

converge, then J^JLoi^^j + ^j) converges too. 

Because Qp is complete, an infinite series J2jLo dj of p-adic numbers con- 
verges if and only if the sequence of partial sums forms a Cauchy sequence. 
In the p-adic case this is equivalent to 

(2.8) lim aj = 0, 

j— ►oo 



8 



because of the ultrametric property of the p-adic absolute value function. In 
particular, in the p-adic case, if an infinite series J2'jLo % converges, then so 
does I]^o^i% whenever \9j\p < 1 for all j. 

Let Go, ai, . . ., be a sequence of p-adic numbers, and consider the associ- 
ated power series Z^^o^j ^"'^ where again is interpreted as being equal to 
1 when j — for all x. If Z^^o % converges for some particular xq G Qp, 
then 

(2.9) lim aj 4 = 
in Qp, which is to say that 

(2.10) lim |a.L|a;op = 

j— >oo 

as a limit of real numbers. It follows that 

(2.11) lima,x^ = 

j— ►oo 

for all p-adic numbers x such that \x\p < \xo\p. 

Let X be a p-adic number. If x 7^ 1, then for each positive integer n we 
have that I]"=o is equal to (1 — x^^^)/ (l — x). If \x\p < 1, then we get that 
Z^jlo^'^ converges, and that the sum is equal to 1/(1 — x). 

By definition of the p-adic absolute value, if a; e Z, then \x\p < 1. Let x 
be a rational number such that \x\p < 1. This means that we can write x as 
a/n for some positive integer n which is not a multiple of p. More precisely 
we can write x as a/{b + pi), where b, I are integers and 1 < 6 < p. We may 
as well assume that 6 = 1, because otherwise we could multiply both the 
numerator and denominator by a positive integer c such that 6 c is equivalent 
to 1 modulo p. 

Thus we have a rational number x which is expressed as a/(l -|-pZ) for 
some integers a, I. It follows from the earlier discussion that 

00 

(2.12) X = aj2i-Piy, 

j=0 

where the series converges in the p-adic metric. Using the approximation by 
partial sums we get that x is a limit of integers in the p-adic metric, so that 
the set of rational numbers with p-adic absolute value less than or equal to 1 
is the same as the closure of the set of integers in the p-adic metric. Similarly 
the set of p-adic numbers with p-adic absolute value less than or equal to 1 
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is equal to the closure of the set of integ subset of Qp with respect to 

the p-adic metric, and this set is denoted Zp and called the p-adic integers. 

For each positive integer n, if x is an integer, then we can write x as 
b + p'^y, where 6, y are integers and < 6 < p". In other words, not only 
does every integer have p-adic absolute value less than or equal to 1, but we 
can express Z as the union of subsets each with p-adic diameter equal 
to p~^. In fact the same is true of the p-adic integer Zp, by approximation. 
It follows that Z, Zp are totally bounded with respect to the p-adic metric. 
Because Qp is complete with respect to the p-adic metric and Zp is closed, we 
obtain that Zp is a compact subset of Qp with respect to the p-adic metric. 

Similarly, p~' Zp is a compact subset of Qp for each positive integer I. To 
be more precise, p~' Zp consists of the x G Qp of the form p~' y for some 
y G Zp, which is the same as the set of x G Qp such that \x\p < p'. Every 
bounded subset of Qp is contained in p~' Zp for some positive integer /, and 
therefore a subset of Qp is compact if and only if it is closed and bounded. 



Let Z[l/p] denote the set of rational numbers of the form p~' a, where a is 
an integer and / is a nonnegative integer. Clearly Z[l/p] is dense as a subset 
of Q with respect to the standard metric. One can also check that Z[l/p] 
is dense as a subset of Q with respect to the p-adic metric. This reduces to 
the fact that every rational number x with \x\p < 1 can be approximated by 
integers in the p-adic metric. 

Consider the Cartesian product Q x Q, consisting of ordered pairs {xi^X2) 
with Xi,X2 G Q. Let us combine the standard and p-adic metrics on Q into 
a product metric on Q x Q, in which the distance from (xi, 0:2) to y^) in 
Q X Q is defined to be the maximum of \xi — yi\ and \x2 — y2\p- That is, we 
use the standard distance in the first coordinate and the p-adic distance in 
the second coordinate. 

The diagonal embedding of Q into Q x Q sends a; G Q to (a;, x). We can 
use this embedding to map Z[l/p] into Q x Q. 

Suppose that x G Z[l/p]. If \x\p < 1, then x G Z, in which case either 
X = or l^l > 1. In other words, either x = 0, or 

(3.1) max(|a;|, \x\p) > 1. 



3 
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If X, y e Z[l/p], then either x — y, or 

(3.2) max(|a; - |/|, |a; - 7/|p) > 1. 

Thus the image of Z[l/p] in Q x Q is discrete with respect to the product 
metric. Namely, the distance between any two distinct points in the image 
of Z[l/p] in Q X Q is at least 1. 

Moreover, any point in Q x Q is at bounded distance from a point in 
the image of Z[l/p] under the diagonal embedding. Explictly, if (y, w) is any 
element of Q x Q, then there is an x e Z[l/p] such that 

(3.3) \x-y\<l 
and 

(3.4) \x-w\p<l. 

We may as well assume that y.w G Z[l/p], because Z[l/p] is dense in Q with 
respect to both the standard and p-adic metrics. Let us write w as y + a + b, 
where a is an integer and < 6 < 1, and put x = y + b. Then x — y = b and 
X — w — —a have the required properties. 

Now let be a finite set of primes, which we can also enumerate as 
Pi, . . . ,Pn, and let Z^ denote the set of rational numbers of the form 

(3.5) pr'^---p;'"a, 

where a is an integer and are nonnegative integers. Thus Ze is 

dense as a subset of Q with respect to the standard metric |a; — 7/| as well as 
the pi-adic metrics \x — y\p^ for i = 1, . . . ,n. 
Consider the Cartesian product 

(3.6) QxQx---xQ, 

with n + 1 copies of Q. We define the distance between two points in this 

Cartesian product to be the maximum of the standard distance between the 
first coordinates and the pj-adic distance between the {i + l)th coordinate 
when 1 < i < n. We can embed Zg into this Cartesian product using the 
diagonal embedding, which sends a; e Q to an (n+l)-tuple whose coordinates 
are all equal to x. 

If X e Ze and the pj-adic absolute value of x is less than or equal to 1 for 
i — 1, . . . ,n, then x is an integer. As a result, either x — 0, or the standard 
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absolute value of x is greater than or equal to 1. This implies that if we take 
two distinct elements of and consider their embeddings into the Cartesian 
product of n+ 1 copies of Q, then the distance between the two points in the 
Cartesian product is greater than or equal to 1. Thus 7^e becomes discrete 
in the Cartesian product. 

Suppose that {y,Wi,... ,Wn) is an element of the Cartesian product of 
n + 1 copies of Q. We would like to show that there is a point in the image 
of Z^; under the diagonal embedding whose distance to {y, Wi, . . . , u;„) is less 
than or equal to n. Specifically, let us check that there is an a; G Zg such 
that 

(3.7) 1^^ ~ y| < 

and 

(3.8) \x - Wi\p. < 1 

for each i, 1 < i < n. 

We may as well assume that y & Tie, since is dense in Q with respect 
to the standard metric. For i = 1, . . . . nwc may assume that Wi—y G Z[l /pi], 
since Z[l/pi] is dense in Q with respect to the p^-adic metric. Thus we can 
write Wi as y + Gi + bi, where is an integer and 6j G Z[l/pj] satisfies 

< 6j < 1. If we take x — y + bi + \- bn, then it is easy to see that x 

has the required properties. This uses the fact that every element of Z[l/q\ 
has p-adic absolute value less than or equal to 1 when p, q are distinct prime 
numbers. 



4 Exponential functions 

Let Z^^o % ^^<i X^i^o be formal power series. If we multiply these two 
series formally, then we get a power series J2'^^=o Cn-^", where 

n 

(4.1) Cn^Y.'^i^n-j 

for each n > 0. The sequence of c„'s is called the Cauchy product of the aj's 
and 6;'s. 

Now suppose that J2JLo % ^i^o convergent series of complex 

numbers. We can define the c„'s as in the preceding paragraph, and consider 
the series J2^=oCn- Does this series necessarily converge? If so, is the sum 
equal to the product of the sums of the a/s and 6;'s? 
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If there are only finitely many nonzero flj's and 6;'s, then this is simply 
an exercise in arithmetic. Suppose now that the a^-'s and 6;'s are nonnegative 
real numbers. It is easy to see that each partial sum of I]^o is less than 
or equal to 



(4.2) E^^ 

Hence Y^^=q c„ converges, and the sum is less than or equal to the aforemen- 
tioned product. One can show too that I]J^Lo is equal to the product of the 
sums of the a^-'s and fe^'s, because it is greater than or equal to the product 
of any of their partial sums. 

Using this one can check that if Z^^lo '^i ^IZo converge absolutely, 
then I]^o converges absolutely. Namely, one applies the previous case 
to \aj\, \bi\, and one notes that |c„| is less than or equal to the nth term of 
the Cauchy product of the absolute values of the a/s and ft^'s. One way to 
show that the sum of the c„'s is equal to the product of the sums of the a/s 
and the fe^'s is to decompose the series into linear combinations of series with 
nonnegative entries and apply the result already known for them. Another 
way is to approximate the series of a^-'s and 6/'s by finite sums. For finite 
sums we get the right answer by arithmetic, and the point is to check that 
small errors for the sums of the a/s and 6/'s lead to small errors for the sum 
of Cn's in a suitable manner. 

It is a nice exercise to check that Z^J^o converges, and that the sum is 
equal to the product of the sums of the a/s and 6/'s, if one of J^'j^o^j 
Y^iZo only finitely many terms and the other is a convergent series. A 

refinement of this states that if one of I^jlo'^j' J2'ilobi converges absolutely 
and the other converges, then I]^o converges and is equal to the product 
of the sums of the a^-'s and bi's. See |Tfj . 

A theorem of Abel states that if Sj^o ^j^ ^i^o and all converge, 

then the sum of the c„'s is equal to the product of the sums of the a^'s and 
his. To prove this, let r be a positive real number such that r < 1, and put 

oo oo oo 

(4.3) A{r)=Y^ajr\ B{r) =Y^hir\ C{r) = Y^Cr,r^. 

j=0 /=0 n=0 

If the Oj's and his are bounded, for instance, then the c„'s grow at most 
linearly, and the series in the definitions of A(r), B{r), and C{r) converge 
absolutely when < r < 1. We also have that 

(4.4) C{r) = A{r)B{r) 
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for < r < 1, because the series defining C(r) is the Cauchy product of the 
series defining A{r) and B{r). 

By definition, Abel summahility of Z^^o'^i' YldLo^i^ H'^=QCn means the 
existence of the hmits of A (r), i?(r), C(r) as r —> 1, < r < 1, in which case 
the Abel sum is defined to be the hmit. Ordinary convergence of an infinite 
series imphes Abel summability, with the Abel sum equal to the sum as the 
limit of the partial sums. The Abel sum of the Cauchy product is equal to 
the product of the Abel sums when they exist, and it follows that if the series 
converge, then the sum of the Cauchy product is equal to the product of the 
sums of the other two series. See 07j for more information. 

In the p-adic case the situation is simpler. The series Z^j^o'^i; H^oh 
converge if and only if 

(4.5) lim aj = lim bi = 0, 

j—*oo l^oo 

and in this event 

(4.6) lim c„ = 0, 

n— >oo 

as one can check. To see that the sum of the c„'s is equal to the product 
of the sums of the a^-'s and fc^'s, one can approximate by finite sums and 
show that the relevant error terms are small. This is analogous to one of the 
arguments for absolutely convergent series of real or complex numbers. 

Let us now consider the formal power series expansion for the exponential 
function, 

oo ~n 

(4.7) E{z) = Y.-- 

n=0 

As usual, n\ denotes "n factorial", which is the product of the integers from 
1 to n, and which is interpreted as being equal to 1 when n = 0. Formally 
we have that 

(4.8) E{z + w) = E{z)E{w), 

in the sense that if one expands the series and collect terms then the coeffi- 
cients match up, as a result of the binomial theorem. 

Let us focus first on the case of complex numbers. For each z G C one 
can show that the series defining E{z) converges absolutely, and indeed the 
terms tend to faster than a geometric series. In other words, the power 
series defining E{z) has infinite radius of convergence. Hence the formal 
identity ()4.8|) does work for the actual sums for all 2;, w G C, as a special case 
of the earlier discussion of Cauchy products. In particular, E{z) 7^ for all 
complex numbers z, with 1/E{z) = E{—z). 
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If X is a nonnegative real number, then E{x) is real and E{x) > 0. If x is 
a real number and x < 0, then E{x) is a real number such that < E{x) < 1, 
since E{x) = l/E[—x). If ^ is a complex number, then the complex conjugate 
of E{z) is equal to E{z). This follows from the series expansion for E{z), 
since the coefficients are real numbers. As a consequence we get that 

(4.9) l^(^) r = E{z) E{z) = E{z + z)= E{2 Re z) 

for every complex number z. 

For the p-adic case we should begin by considering the number of factors 
of p in n!. The number of positive integers less than or equal to n which are 
divisible by p is equal to the integer part of n/p. For each positive integer k, 
the number of positive integers less than or equal to n which are divisible by 
p^ is equal to the integer part of n/pk- The total number of factors of p in 
n! is equal to the sum of the integer parts of n/p^ over all positive integers 
k. This sum is less than n/{p — 1), by comparison with a geometric series. 

It follows that the series for E{x) converges in Qp when x G Qp has p- 
adic absolute value less than p~^/(p~^). This discussion follows the one in [26] 
starting on pll2. As explained very nicely there, this condition may seem a 
bit strange, since — 1) is an integer only when p = 2, but in fact one may 
wish to consider E{x) on extensions of Qp where the extension of the p-adic 
absolute value has nonzero values other than integer powers of p. A related 
point is that there can be Galois actions on such an extension which then 
interact with the exponential in a nice way, just as complex conjugation does 
in the complex case. At any rate, because of the ultrametric property for 
the p-adic absolute value, a disk around is closed under addition, and one 
again has the identity that the exponential of a sum is equal to the product 
of the corresponding exponentials. 



5 Normed vector spaces 

Let V he a. vector space over the real or complex numbers. By a seminorm 
on V we mean a nonnegative real-valued function N{v) on V such that 

(5.1) N{av) = \a\N{v) 

for all real or complex numbers a, as appropriate, and all v E V, and such 
that 

(5.2) N{v + w)<N{v)+N{w) 
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for all v,w G V. 

Recall that a subset £^ of is said to be convex if for every pair of vectors 
v,w E E and every real number t with < t < 1 we have that 

(5.3) tv + {l-t)w e E. 

Under the homogeneity condition ()5.1|) . one can check that the triangle in- 
equality ()5.2|) holds if and only if 

(5.4) {veV : N{v) < 1} 

is a convex subset of V. 

If N{v) is a seminorm on V, and if N{v) > for a\\ v & V with f 7^ 0, 
then we say that N{v) is a norm on V. In this event we get a metric on 
given by N{v — w). 

As a special case, suppose that {v,w) is an mner product on y, or more 
precisely a hermitian inner product in the complex case. This means that 
(f , w ) is a real or complex- valued function, according to whether y is a real 
or complex vector space, defined for v,w &V, such that {v,w) is a linear 
function of v for each w E V, 

(5.5) {w, v) = {v, w) 
when is a real vector space and 



(5.6) {w, v) = {v, w) 

when y is a complex vector space, and {v, v) is a nonnegative real number 
for all f G which is equal to if and only if f = 0. If we put 

(5.7) \\v\\ = {v,vY'\ 
then the Cauchy-Schwarz inequality states that 

(5.8) |(f , w)\ < \\v\\ \\w\\ 

for all v,w E V. This can be verified using the fact that {v + aw,v + aw) 
is a nonnegative real number for all scalars a. Using the Cauchy-Schwarz 
inequality one can check that 

(5.9) ll^^ + wf < (||i;|| + ||u;||)2 
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for all v,w &V, so that \\v\\ does in fact define a norm on V. 
If = R", then the standard inner product on V is given by 

n 

(5.10) {v,w) = Y,VjWj. 

li V = C", then the standard Hermitian inner product is defined by 

n 

(5.11) {v,w) = Y,VjW]. 

i=i 

The associated norm is the same as ||f II2 defined next. 
Let p be a real number with 1 < p < 00, and put 



(5.12) 




ioT V = (f 1, . . . , Vn) in R" or C". We can extend this to j9 = 00 by setting 

(5.13) \\v\\oo = max(|fi|, . . . , \vn\). 

For 1 < p < 00 we have that ||f ||p satisfies the homogeneity condition (j^HJ, 
and is equal to if and only if u = 0. When p = l,oo one can check the 
triangle inequality directly from the definitions, and when p = 2 this follows 
from the preceding discussion about inner product spaces. In general when 
1 < p < 00 one can check that the closed unit ball associated to is a 
convex set, and hence that \\v\\p defines a norm, using the convexity of the 
function defined on the nonnegative real numbers. 
For 1 < p < 00 we have that 

(5.14) \\v\\^ < \\v\\p 

for all V in R" or C", by inspection. Using this one can verify more generally 
that 

(5.15) ||f 11, < \\v\\p 

for all V in R" or C" when 1 < p < q < 00. 
If r is a real number with r > 1, then 



(5.16) 
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for all nonnegative real numbers Xi, . . . , x„, by the convexity of the function 
on the nonnegative real numbers. As a result, 

(5.17) \\v\\p < n'/^-^/'^ \\v\\g 

when V is an element of R" or and 1 < p < g < oo. This also works with 
q = oo, 1/q = 0, hy inspection. 

Suppose that N{v) is a seminorm on or C". One can check that N(v) 
is bounded by a constant times the Euclidean norm ||i'||2, or any other 
if one prefers, where the constant can be estimated in terms of the values of 
N at the standard basis vectors. 

On any real or complex vector space V, ii N{v) is a. seminorm on V, then 

(5.18) N(v) <N(w) + N(v-w) 
for all V, w e y. Similarly, 

(5.19) N{w) <N{v) + N{v-w) 
for all V, w e y, and therefore 

(5.20) \N{v) - N{w) \<N{v- w) 

for all v,w ^ V. If is a norm, then is continuous with respect to the 
metric associated to A^. If V is R" or C", then A^ is also continuous with 
respect to the standard Euclidean metric, using the remark in the previous 
paragraph. It follows that there is a positive real number rj such that N{v) > 
T) when \\v\\2 — 1, which is to say that v lies on the standard Euclidean sphere, 
since N{v) is a positive continuous function on the sphere and the sphere is 
compact. 

One could define the notion of a norm just as well on a vector space over 
a subfield of the real or complex numbers, like the rational numbers. One 
should be a bit careful, in that for instance if a is an irrational number, then 
N{x) — \xi — ax2\ defines a norm on which is more degenerate than 
norms on R^ or C^. 

Instead one might consider vector spaces defined over the rational or p- 
adic numbers with respect to the ]9-adic absolute value function on scalars. 
In this case one might consider the usual triangle inequality for seminorms, 
as above, or the stronger "ultrametric" version requiring that the seminorm 
applied to a sum of two vectors is less than or equal to the maximum of 
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the values of the seminorm at the two vectors. Assuming homogeneity, this 
stronger ultrametric version of the triangle inequality is equivalent to saying 
that the set of vectors in the space with seminorm less than or equal to 1 is 
closed under addition. 

More generally one might consider vector spaces over fields with absolute 
value functions, including extensions of the p-adic numbers, as in ^21 120] ■ 
This may involve fields which are not locally compact, and sometimes one is 
interested in completeness instead, as in jT21 1211 • 

6 Dual spaces 

Let \^ be a finite-dimensional real or complex vector space, and let V* denote 
the dual vector space of linear functionals on V. Thus V* consists of the linear 
mappings from V into the real or complex numbers, whichever is the scalar 
field for V. One can add linear functionals and multiply them by scalars, so 
that V* is indeed a vector space with the same field of scalars as V. 

Suppose that f i, . . . , f„ is a basis for V, so that any vector v & V can be 
expressed in a unique way as a linear combination of the fj's. If A is a linear 
functional on V, then A is uniquely determined by A(f i), . . . , A(f„), since 
X{v) for any v E V can be computed from the knowledge of these quantities 
and the coefficients of v in the basis. Furthermore, for any collection of n 
scalars ai, . . . , there is a linear functional A on y such that X{vj) = aj 
for each j. In particular, the dimension of V* is equal to the dimension of V. 

Now suppose that V is also equipped with a norm \\v\\. Let A be any 
linear functional on V, and put 



To see that this is finite one can use an isomorphism between V and R" or 
C", as appropriate, and the fact that ||f || is equivalent to a standard norm 
given explicitly in terms of coordinates of vectors, as in Section |3| 

Equivalently, ||A||* can be characterized as a nonnegative real number 
such that 



for a\\ V E V and ||A||* is as small as possible. One can check that ||A||* 
defines a norm on the dual space V*, called the dual norm associated to the 
norm ||f || on V. 



(6.1) 




(6.2) 
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For instance, let V he a real or complex vector space equipped with 
an inner product {v,w). For each w & V, X{v) = {v,w) defines a linear 
functional on V. Using the Cauchy-Schwarz inequality one can check that 
the dual norm of A is equal to the norm of w, with respect to the norm on 

V associated to the inner product. 

Now let n be a positive integer, and let V be R" or C". A linear functional 
A on \^ can be represented explicitly as 

n 

(6.3) X{v) = J2vj^j^ 

V = (vi, . . . ,Vn), where w = {wi, . . . ,Wn) is an element of R*^ or C", as 
appropriate. If V is equipped with the norm ||f||i as in Sectional then we 
have that |A(f)| < ||w||oo ll'^^lli for all v E V, just by the triangle inequality. 
One can also verify that ||u'||oo is the smallest nonnegative real number with 
this property. In other words, if we use the norm || ■ ||i on V, then the dual 
norm is given by || ■ ||oo- 

Now suppose that we use the norm UtiHoo on V^. As in the previous 
paragraph we have that |A(f )| < ||f ||oo for all f G V" when A is associated 
to w as in ()6.3|) . by the triangle inequality, and that \\w\\i is the smallest 
nonnegative real number with this property, so that the dual norm of A is 
exactly \\w\\i. 

Let p, q be real numbers with 1 < p,q < oo and 1/p + 1/g = 1. In this 
case we say that p, q are conjugate exponents. One can check that 

6.4 ab<— + — 

P Q 

for any nonnegative real numbers a, b, and indeed one can view this as a 
consequence of the convexity of the exponential function on the real line. 
If ai, . . . , a„ and 6i, . . . , 6„ are nonnegative real numbers, then Holder's in- 
equality states that 

n . n \ 1/p / " \ l/<? 

j=l ^k=l ^ ^1=1 ^ 

This follows from the previous inequality when Ylik^A^ — 1 ^^"^ ^i^l ^ 1; 
just by applying the inequality termwise and summing, and one can derive 
the general case from this by a scaling argument. 
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If we use the norm ||f ||p on V , and if the hnear function A is associated 
to an n-tuple w as before, then we have that |A(f )| < \\w\\g \\v\\p for all f G K 
by Holder's inequality. For a fixed w one can choose f 7^ so that this 
inequality becomes an equality, as one can check. As a result, the dual norm 
of A associated to the norm \\v\\p on V is equal to \\w\\q. 

7 Operator norms 

Let K be a finite-dimensional real or complex vector space, and let JC.{V) 
denote the collection of linear mappings from V into itself. Thus C{V) is a 
vector space in a natural way, since one can add linear transformations on 
V and one can multiply them by scalars. Moreover one can compose lin- 
ear transformations on V, which provides a kind of multiplication on C{V), 
making it an algebra rather than simply a vector space. The identity trans- 
formation I on V, which sends each vector v ^ V to itself, acts as a multi- 
plicative identity element in the algebra, since the composition of any linear 
transformation T onV with I is equal to T. 

Let vi, . . . ,Vnhe a, basis for V. If T is a linear transformation on V, then 
T is uniquely determined by its values on the f j's. The image of each vj under 
T is a vector in V and therefore characterized by its n coefficients with respect 
to the basis Vi, . . . ,Vn. Conversely one can start with ri^ scalars, which can 
be arranged naturally into an n x n matrix, and get a linear transformation 
T on y for which the given scalars are the coefficients of the T(i>j)'s in the 
basis. In particular jC{V) has dimension as a vector space. 

Suppose that V is equipped with a norm \\v\\. If T is a linear transfor- 
mation on V, then put 

(7.1) ||T||„p = sup{||T(^;)|| -.veV, \\v\\ < 1}. 

That this is finite can be seen using an isomorphism between V and R" or 
C", as appropriate, and the equivalence of with a standard norm given 
in terms of coordinates. 

One can also characterize ||T||op, called the operator norm of T associated 
to the norm II^H on V, as the smallest nonnegative real number such that 

(7.2) |TO||<||Ti|,||^|| 

for all V & V. It is easy to verify that this does indeed define a norm on 
the vector space of linear transformations on V, and it enjoys the additional 
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property that 

(7.3) ||^l0^2||op< ll^lllop ||r2||op 

for any linear operators Ti, T2 on V . 

Of course the norm of the identity transformation / on is equal to 1. 
If T is an invertible linear transformation on V, so that there is a linear 
transformation on V whose composition with T is equal to /, then 



(7.4) l = ||J|Up<||T||,p||T- 



I op. 



Suppose that A is a linear transformation on V", f is a nonzero vector in V, 
and that a is a real or complex number, as appropriate. We say that v is an 
eigenvector for A with eigenvalue a if 

(7.5) A{v)=av. 
In this event 

(7.6) |a| < \\A\\op. 

Let V be R" or C", and let ei, . . . , denote the standard basis vectors for 
V, which is to say that the /th component of Cj is equal to 1 when / = j and is 
equal to otherwise. Suppose that we use the norm ||f ||i from Section El for 
V. If T is a linear operator on V, then the operator norm of T with respect 
to this norm on V is equal to the maximum of the norms of T(ei), . . . , T(e„). 
This is not difficult to verify just from the definitions. Of course this can 
be expressed explicitly in terms of the absolute values of the entries of the 
matrix associated to T with respect to the standard basis of e/s. 

Suppose instead that we use the norm H^Hoo from Sectional We can think 
of T as being described by n linear functionals Ai, . . . , A„ on V, where Xj{v) 
is the same as the jth component of T{v) for all v & V. The operator norm of 
T with respect to the norm \\v\\oo on V is equal to the maximum of the dual 
norms of Ai, . . . , A„, as one can easily verify. The dual norm associated to 
II V II 00 was determined in the previous section, and thus the operator norm of 
T can again be given explicitly in terms of the absolute values of the matrix 
entries of T with respect to the standard basis in this case. 

Let T be a linear operator on V whose operator norm with respect to 
each of || ■ ||i and || ■ ||oo is less than or equal to 1. In other words, assume 
that 

(7.7) ||T(t;)||i<||t;||i 
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and that 

(7-8) ||T(^;)||oo< ||^;||oo 

for all V E V. In terms of the matrix of T associated to the standard basis 
ei, . . . , e„, this is equivalent to saying that the sum of the absolute values of 
the matrix entries in any row or column is less than or equal to 1. A result 
of Schur implies that 

(7.9) ||T(^;)||,<||^;||, 

for any p, 1 < p < oo, and all v &V, which is to say that the operator norm 
of T with respect to || • ||p is also less than or equal to 1. 

To show this we may as well assume that the matrix entries of T are 
nonnegative real numbers, and we may as well restrict our attention to vectors 
V whose components are nonnegative real numbers. In other words, we can 
reduce to this case by putting in absolute values everywhere and applying the 
triangle inequahty repeatedly. The hypotheses on T still hold if we replace T 
with the linear transformation whose matrix entries are the absolute values 
of the matrix entries of T. 

Under these conditions, one can check that the pth power of the jth 
component of T{v) is less than or equal to the jth component of T apphed 
to the vector given by the pth power of the components of v. This follows 
from the convexity of the function on the nonnegative real numbers, using 
the fact that the operator norm of T with respect to || ■ ||oo is less than or 
equal to 1. Because T has operator norm less than or equal to 1 with respect 
to II • 111, it follows that the sum of the pth powers of the components of T{v) 
is less than or equal to the sum of the pth powers of the components of v. 
This says exactly that 117(^)11^ < \\v\\^, as desired. 

As another special case, let T be a linear operator on R" or C" which is 
diagonalizcd by the standard basis ei, . . . , e„. That is, wc assume that there 
are real or complex numbers cti, . . . , «„, as appropriate, so that 

(7.10) T{ej)^ajej 

ioT j — 1, . . . ,n. In this event the operator norm of T is equal to 

(7.11) max(|Q;i|, . . . , |q;„|) 

with respect to any of the norms ||t'||p, l<p<oo. 

Now suppose that y is a finite- dimensional real or complex vector space 
equipped with an inner product {v,w), and let ^4 be a linear transformation 
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on V. We say that A is self-adjoint if 

(7.12) {A{v),w) = {v,A{w)) 

for all v,w & V. A famous theorem states that A can be diagonahzed in 
an orthonormal basis in this situation. In other words, there exist vectors 
f 1, . . . , VninV and real numbers such that 

(7.13) {vj,vi) = 

when j 7^ I, {vj,Vj) = 1 for all j, every element of V can be expressed as a 
linear combination of the f/s, and A{vj) = cxjVj for each j. The operator 
norm of A is then equal to the maximum of . . . , with respect to the 
norm associated to the inner product. 

8 Geometry of numbers 

Consider Q", the subset of R"' consisting of points with rational coordinates, 
and suppose that that N{v) is a seminorm on Q". As in Section|31 this means 
that N{v) is a nonnegative real- valued function defined for v G Q" such that 
N{av) = \a\ N{v) for all a G Q and v eV, and N{v + w) < N{v) + N{w) for 
3l\v,w e V. Because A^(f) < N{w) + N{v-w) aiadN{w) < nIv) + nIv-w) 
for all V, w G Q", we have that 

(8.1) \N{v) - N{w)\ < N{v -w) 

for all v,w E Q", as before. We also have that N{v) is bounded by a constant 
multiple of the Euclidean norm on Q", with an estimate in terms of N{ej), 
1 < J < 'T-, where the e^-'s are the standard basis vectors in R". It follows 
that N{v) is a uniformly continuous function on Q", and therefore has a 
unique continuous extension to a function on R" which is in fact a seminorm 
on R". 

The extension of to R" may or may not be a norm, even if A^ is a 
norm on Q". For if 6 is any real number, then N{v) = \vi — 6v2\ defines a 
seminorm on R^ and on by restriction. As in Section if ^ is irrational, 
then N{v) is a norm on Q^, but it is not a norm on R^ for any 6. At any rate, 
if we start with a norm on Q" for some n, we can extend it to a seminorm 
on R", and it is interesting to consider the interplay between the norm on 
Q" / seminorm on R" and arithmetic. 
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Suppose now that U is an open subset of R", and let Z" denote the 
subset of R" of points with integer coordinates, which is of course closed 
under addition. If the volume of U is strictly larger than 1, then there are 
points x,y G U with x ^ y and x — y E Z"^. To see this it is convenient 
to think of the quotient of R" by Z", which is a torus whose total volume 
is equal to 1, and the natural projection p from R" onto the quotient. The 
existence of distinct points x, y in U whose difference is an element of Z" is 
equivalent to saying that the restriction of p is not one-to-one, which follows 
immediately if the volume of U is strictly larger than the volume of the 
quotient torus, which is equal to 1. 

Assume further that U is symmetric about the origin, so that w e U 
implies —w G U. We can rephrase the previous conclusion then to say that 
there are points x,y E U such that x + y is a nonzero element of Z". If 
U is also convex, then [x + y)/2 is a nonzero element of U which lies in 
(1/2)Z", which is to say that its coordinates are integers or half-integers. We 
can rephrase this again by saying that if is a convex open subset of R" 
which is symmetric about the origin and which has volume strictly larger 
than 2", then U contains a nonzero element of Z". This is a version of the 
basic existence result in the geometry of numbers. 

9 Linear groups 

Let y be a finite-dimensional real or complex vector space, and let GL(V) 
denote the group of invertible linear transformations on V. Of course any 
nonzero multiple of the identity operator / is invertible. As a subset of the 
vector space C{V) of linear transformations on V, GL{V) is open, since it 
consists simply of the linear transformations with nonzero determinant. 

We can also look at this in terms of norms. Let | It'll be a norm on V, so 
that we have an associated norm ||T||op for linear operators on V. If T is an 
invertible linear transformation on V, then there is a real number c > such 
that 



for all V E V, namely, c = 1/||T ^||op. If ^4 is a linear operator on V such 
that \\A\\op < c, then 



(9.1) 



c\\v\\ < \\T{v)\\ 



(9.2) 
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for all V & V. It follows that T + A is injective, and hence invertible, since 
V is assumed to have finite dimension. 

Once we specify a norm ||f || on V, we get a nice subgroup of GL{V), 
namely the group of linear isometrics on V, which are the linear mappings 
T from V to itself such that 

(9.3) \\T{v)\\ = \\v\\ 

for all V E V. This is the same as saying that both T and have norm 
equal to 1, and of course the identity operator I is always an isometry. The 
group of isometrics on is a compact subset of GL{V), because it is closed 
and bounded. If the norm on V comes from an inner product, then the group 
of isometrics is quite rich, and is known as an orthogonal or unitary group, 
according to whether y is a real or complex vector space. 

Let us take V = R", and consider the group of invertible linear transfor- 
mations T on R" which map Z" onto itself. A linear mapping T on R" maps 
Z" into itself if and only if the matrix associated to T and the standard basis 
ei, . . . , e„ in R" has integer entries. In order that T be an invertible linear 
transformation on R" which takes Z" onto itself the matrices associated to 
both T and should have integer entries. This is equivalent to saying that 
the matrix associated to T has integer entries and determinant equal to ±1. 

Now consider Q^, the space of n-tuples of p-adic numbers, as a vector 
space over Qp with respect to coordinatewise addition and scalar multipli- 
cation. The group of invertible linear transformations on is described by 
the condition that the determinant is nonzero, and is an open subset of the 
vector space of all linear transformations on Qp, which can be identified with 
Qp . A natural norm on is given by 

(9.4) N{v) ^max{\vi\p,...,\vn\p) 

for v = {vi, . . . , Vn) e Qp. The linear mappings T on ^ which are isometrics 
with respect to this norm can be characterized by the condition that the 
matrix with respect to the standard basis has entries in Zp and the deter- 
minant has j9-adic absolute value equal to 1, so that the inverse matrix also 
has entries in Zp. This is a compact subgroup of the group of all invertible 
linear transformations on Qp. 
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10 Trace norms 



Let V be a finite- dimensional real or complex vector space equipped with an 
inner product {v, w). As before, a linear transformation ^4 on y is self-adjoint 



for all v,w eV. In this case A can be diagonahzed in an orthonormal basis 
for V, which is to say that there is an orthonormal basis Vi, . . . ,Vn for V and 
real numbers ai, . . . ,an such that A{vj) = aj Vj for j = 1, . . . ,n. The a/s 
are the eigenvalues of A, and it is easy to verify directly that the eigenvalues 
of A are real numbers even if F is a complex vector space. Notice that the 
self-adjoint linear operators on V form a real vector space, which is to say 
that the sum of two self-adjoint linear operators on F is a self-adjoint linear 
operator on V and that the product of a real number and a self-adjoint linear 
operator on V is a self-adjoint linear operator on V, and that one should use 
real scalars for this even if y is a complex vector space. 

For A as in the previous paragraph and p e R such that 1 < p < oo, put 



This is the Schatten p-dass or Sp norm of A, although the fact that the 
triangle inequality holds for the Sp norm is somewhat tricky and will be 
discussed further in a moment. When p = oo one can define the norm 
of A to be the maximum of the [ctjl's, which is the same as the ordinary 
operator norm of A. 

Let Wi, . . . ,Wn be another orthonormal basis for V, in addition to the 
orthonormal basis Vi, . . . ,Vn oi eigenvectors for A, and consider 



if 



(10.1) 




(10.2) 






1=1 



This is equal to when wj = Vj for each j. In general. 



n 





since A{w) = Y^]=i O-j {w, Vj) Vj for all w E V. 
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Because f i, . . . , f„ is an orthonormal basis, J21'=i I {"Vj, p 



\w 



P for all 



w eV. Similarly, Yh=i\{v,uii) 



P for all V E V. It follows that the 



sum of I {vj, w;) p over j for a fixed I, or the sum over / for a fixed j, is equal 
to 1. 

By the result of Schur mentioned in Section [7| we get that 



(10.5) 



1=1 j=i 



< E 



i/p 



for all real or complex numbers Xi, . . . ,a;„ and 1 < p < oo. This implies 
that p0.3|) is always less than or equal to H^H^p for any orthonormal basis 
Wi, ... ,Wn on V. Therefore || ^Hs^ is the same as the maximum of ()10.3|) over 
all orthonormal bases Wi, ... ,Wn on V, and it follows that || ^H^^ is indeed a 
norm on the real vector space of self-adjoint linear operators on V. 



11 Vector spaces 

Let khe a field, and let be a vector space over k. If fi, . . . , w„ are elements 
of V, then their span is denoted span(fi, . . . ,Vn) and consists of all linear 
combinations 

(11.1) aifiH \-anVn 

of vi, . . . ,Vn, with a„ G A;. The span of f i, . . . , f„ is a linear subspace 

of V, which is to say that it is closed under addition and scalar multiplication. 

A collection of vectors Vi, . . . ,Vn in V is said to be linearly independent 
if the linear combination (jll.lj) of fi, is equal to if and only if the 

scalars aj are all equal to 0, 1 < j < n. This is equivalent to saying that each 
element of the span of fi, . . . ,f„ can be expressed as a linear combination 
of Vi, . . . ,Vn in a unique manner. A collection of vectors vi, . . . ,Vn in V is 
said to be linearly dependent if the fj's are not linearly independent. This is 
equivalent to saying that one of the fj's lies in the span of the others. 

Suppose that f i, . . . , f„ and wi, . . . , Wm are vectors in V, with each vj 
an element of the span of wi, . . . ,Wm- If n > m, then the u/s are linearly 
dependent. Basically this reduces to the fact that a homogeneous system of 
m linear equations and n variables has a nontrivial solution when n> m. 

We say that the vector space V has finite dimension if there is a finite 
collection of vectors in V which span V . The smallest number of vectors in V 
needed to span V is called the dimension of V . If V is the span of f i, . . . , f„ 
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and n is as small as possible, then vi, . . . ,f„ are linearly independent. A 
collection of vectors vi,...,VninV which are linearly independent and whose 
span is equal to V is called a basis for V. 

12 Algebras 

Let A; be a field, and let A be an algebra over k. This means that A is 
a vector space over k equipped with a binary operation of multiplication 
which is associative and which is bilinear with respect to the vector space 
operations on A. We shall also assume that A has a nonzero multiplicative 
identity element e, and hence that A contains a copy of k. 

Of course k itself is a 1-dimensional algebra over k. If is a nonempty 
set, then the vector space J^{E, k) of k-velued functions on E is an algebra 
over k, with the constant function equal to 1 at each point as the identity 
element. If V is a vector space over k. V ^ {0}, then the algebra C{V) of 
linear operators on V is an algebra over k, with the identity operator / as 
the multiplicative identity element. Notice that for any algebra A over k 
with nonzero identity element, one can embed A into the algebra of linear 
operators on V = A, viewed simply as a vector space. Namely, each clement 
a; of ^ induces a linear transformation on A given by left multiplication by 

X. 

An algebra A over k is said to be commutative if the operation of multi- 
plication on A is commutative, i.e., ii xy — yx ior all x,y & A. Let A be 
an algebra over k with nonzero multiplicative identity clement e, which may 
or may not be commutative, and let x be any element of A. One can get a 
subalgebra of A consisting of linear combinations of powers of x and e, and 
this subalgebra is automatically commutative. 

13 Eigenvalues 

Let k he a field and let y be a vector space over k with positive finite 
dimension equal to n. Suppose that A is a linear transformation on V. 
An element « of /c is said to be an eigenvalue for A if there is a nonzero 
eigenvector corresponding to A, i.e., a vector v E V , v 0, such that 

(13.1) A{v)^av. 
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In this case A — al has nontrivial kernel, where / is the identity operator 
on V, and hence is not invertible. Conversely, if A — al is not invertible for 
some a & k, then A — a I has nontrivial kernel, since V is assumed to be 
finite-dimensional, and this implies that a is an eigenvalue for A. 

As a vector space over k, the algebra JC{V) of linear transformations on 
V has dimension equal to n^. If T is a linear transformation on V, then 
there is a positive integer / < such that T' can be expressed as a linear 
combination of the T-^'s with 1 < j < I and the identity operator /, since 
otherwise JC{V) would have dimension larger than n^. Here refers to the 
product of j T"s when j is a positive integer. The Cayley-Hamilton Theorem 
gives a more precise version of this, with expressed as a linear combination 
of , 1 < j < n, and the identity operator. 

At any rate there is a positive integer / and scalars cq, ■ ■ ■ ,ci-i e k such 
that 

(13.2) T' = Q_iT'-^ + --- + ciT + co/. 

If T is invertible and / is as small as possible, then cq ^ 0, because otherwise 
we could remove a factor of T and express T'~^ as a hnear combination of 
smaller powers of T and the identity operator. As a result, if T is an invertible 
linear transformation on then the inverse of T can be expressed as a linear 
combination of powers of T and the identity operator on V . 

Let A be a linear operator on V again, and let A denote the commutative 
subalgebra of consisting of linear operators on V which can be expressed 
as a linear combination of powers of A and the identity transformation. Prom 
the previous remarks it follows that if T G ^ and T is invertible as a linear 
operator on V ^ then the inverse of T also lies in A. 

Suppose that a G /c is an eigenvalue of ^4, and let v be a nonzero vector 
in V which is an eigenvector for A with eigenvalue a. Of course v is also an 
eigenvector for the identity operator with eigenvalue 1, and v is an eigenvector 
for A^ with eigenvalue for every positive integer j. If T is any element of 
A^ then V is an eigenvector for T with eigenvalue that we shall denote 4>{T). 
One can check that defines an algebra homomorphism from A onto k. 

Conversely, suppose that we start with a homomorphism from A into 
k which is not identically equal to 0. It follows that (f)(1) — 1, and that 
(f) maps A onto k. If T is any element of A which is invertible as a linear 
transformation on V, and hence has its inverse in A, then 4>{T) ^ 0. 

Put a = 4>{A). It follows that applied to /I — a / is equal to 0, and 
hence A — al is not invertible, which is to say that a is an eigenvalue of 
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A. In short, the set of eigenvalues of A is equal to the set of values of 
nonzero homomorphisms from A into k at A. Notice also that each nonzero 
homomorphism from A into k is determined by its value at A. Therefore the 
number of nonzero homomorphisms from A into k is equal to the number of 
distinct eigenvalues of A. 

14 Polynomials, formal power series 

Let k he a field, and let us write Eo(A;), for the spaces of sequences 

Wj}'jLo with aj G k for all j and with aj = for sufficiently large j, depending 
on the sequence, in the case of S(A;). Thus T^o{k), S(A;) are vector spaces over 
k with respect to termwise addition and scalar multiplcation, and So(^) is a 
hnear subspace of T,{k). If {o^I^q, {&i}^o sequences in T,{k), then their 
Cauchy product is the sequence {c„}^q in S(A;) defined by 

n 

(14.1) Cn^Yl % ^n-j 

j=0 

for all n > 0, and this hes in So(A;) if {aj}°^Q and {fe^j^o With respect 
to this product, So(^) and S(A:) become commutative algebras over k. 

Let us write V{k), VS{k) for the algebras of polynomials and formal 
power series over k. Thus an element of V{k) can be expressed as 

(14.2) a„r + a„_ir-^ + --- + ao 

for some cq, . . . , c„ e /c, while an element of VS{k) can be expressed as 

oo 

(14.3) ^a,i^- 

3=0 

for some sequence of coefficients Oq, ai, . . . in k. Here t is an indeterminant, 
and one can add and multiply polynomials or power series in the usual man- 
ner, so that V{k) and VS{k) are commutative algebras over k. Of course 
V{k) is the subalgebra of VS{k) with all but finitely many coefficients equal 
to 0. 

There is a natural one-to-one correspondence between S(/c) and VS{k), 
in which a sequence with terms in k is associated to the formal power series 
with that sequence of coefficients. In this correspondence Eo(A;) is mapped 
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onto V{k). More precisely this defines an isomorphism between T,Q{k) and 
V{k), and between T,{k) and VS{k), as algebras over k, which is to say that 
addition and multiplication are preserved. 

Suppose that A is any algebra over k with nonzero multiplicative identity 
element e, and let x be an element of A. If p{t) is a polynomial over k, with 
p(t) given by (jl4.2j) for some Oq, • • • , ctn ^ k, then we can define p{x) to be 
the element of A given by 

(14.4) a„ + a„_i a;""^ H h ao e. 

Notice that if Pi{t), P2{t) are polynomials over k, then (pi +P2)(a^) = Pi{x) + 
^2(3;) and (piP2)(a;) = Pi{x)p2{x). As a special case, we can take A equal to 
k, viewed as a one-dimensional algebra over itself, and then we are simply 
saying that a formal polynomial p{t) G V{k) defines a function on k in the 
usual manner. As another special case, we can take A to be the algebra 
of linear transformations on So (A;) or S(/c) viewed as vector spaces, we can 
take X to be the linear transformation corresponding to multiplication by the 
indeterminant t, and then p{x) is the linear transformation corresponding to 
multiplication by p{t). 



15 Some other sums 



Consider the vector space over C of doubly-infinite sequences {aj}j2=_oo of 
complex numbers such that aj = when \j\ is sufficiently large, where the 
vector space operations are termwise addition and scalar multiplication, as 
usual. If {aj}j^_oo5 {^1} 1^-00 two such sequences, then we can define the 
Cauchy product to be the sequence {c^jj^.^o of complex numbers with 

00 

(15.1) Cn= J2 (^j^n^r 

j=-oo 

For each integer n, this sum has only finitely many nonzero, and so it makes 
sense. One can also check that c„ = when \n\ is sufficiently large, so that 
{cn}'^=-oo li^s in the vector space under consideration. In this way we get a 
commutative algebra over the complex numbers. 

If {aj}'jL_^ is a sequence of this type, then we can define a function 
associated to it on the non-zero complex numbers by 

00 

(15.2) 

j=-oo 
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Again the sum makes sense because all but finitely many of the terms are 
equal to 0. Thus we get a mapping from our algebra to functions on C\{0}, 
and it is easy to see that this is an algebra homomorphism, i.e., this mapping 
is linear and it takes Cauchy products of sequences to ordinary products of 
functions on C\{0}. 

Now consider doubly-infinite sequences of complex numbers {aj}'^i such 
that 

oo 

(15.3) ^ \aj\ < oo, 

j=-oo 

which is the same as saying that the partial sums J2]=-n l%l bounded. 
In other words the series corresponding to the sequence should converge 
absolutely. As before the space of these sequences is a vector space over the 
complex numbers. We can define a norm on this space by saying that the 
norm of such a sequence is equal to the sum of the moduli of its terms. In this 
way our vector space becomes a Banach space over the complex numbers, 
which is to say that it is a normed vector space which is complete with respect 
to the metric associated to the norm. 

If {aj}j^_oo) {^i}i^~oo ^"^o sequences of this type, with absolutely 
summable terms, then we can define the Cauchy product {cn}5^_oo the 
same manner as before. The series used to define c„ converges absolutely for 
each n e Z. Moreover, one can show that {cn}^=-oo ^Iso lies in our space, 
so that the Cauchy product makes our vector space a commutative algebra 
over the complex numbers. The norm of the Cauchy product is less than or 
equal to the product of the norms of the sequences used in the product, and 
thus we have a commutative Banach algebra. 

If {djj'jL-oo is one of these doubly-infinite sequences of complex numbers 
whose terms are absolutely summable, then we can associate to it the function 
on the unit circle given by 

GO —1 

(15.4) E%^'+ E 

j=0 i=-oo 

This is equivalent to the earlier formula, because = z when \z\ = 1, 
and we need to restrict our attention to complex numbers z with \z\ = 1 to 
ensure that the series converges. The absolute summabihty of the coefficients 
Qj implies that the partial sums of this series converge uniformly on the unit 
circle as in the Weierstrass M-test, and hence the sum defines a continuous 
function on the unit circle. The mapping from sequences to functions is again 
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an algebra homomorphism, which is to say that it is hnear and takes Cauchy 
products of sequences to products of functions on the unit circle. 

A Metric spaces 

By a metric space we mean a nonempty set M together with a real-valued 
function d{x, y) defined for x,y E M such that d{x, y) > for all x,y E M, 
d{x, y) = if and only ii x — y, d{x, y) — d{y, x) for all x,y & M, and 

(A.l) d{x,w)<d{x,y) + d{y,w) 

for all x,y,w e M. This last property is called the triangle inequality for the 
metric d{x,y) on M. If {M,d{x,y)) is a metric space and £^ is a nonempty 
subset of M, then we can view as a metric space in its own right, using 
the restriction of the metric d{x, y) to x,y & M. 

Suppose that (M, d{x, y)) is a metric space and that {xj}^^ is a sequence 
of points in M. We say that {xj}^i is a Cauchy sequence in M if for every 
e > there is a a positive integer L such that 

(A. 2) d{xj,xi) < e 

for all j, I > L. We say that {xj}'jli converges to a point x e M if for every 
e > there is a positive integer L such that 

(A. 3) d{xj, x) < e 

for all j > L. One can check that the limit of a convergent sequence is 
unique, which is to say that if a sequence {x^y^jL^ of points in M xonverges 
to a; G M and to x' G A/, then x' = x. If {xj}°^^ is a sequence of points in 
M which converges to a point x G M, then we write 

(A. 4) lim x-j — X. 

It is easy to see that a convergent sequence in a metric space is a Cauchy 
sequence. Roughly speaking, the property of being a Cauchy sequence cap- 
tures the information of convergence without having a limit. A metric space 
is said to be complete if every Cauchy sequence in the space has a limit. 

Let {M,d{x,y)) be a metric space. Suppose that {xj}'^^, {yjj'^i are 
sequences in M such that 

lim d(xj,yj) = 
34 



(A.5) 



as a sequence of real numbers. One can show that if {xj}'^^ is a Cauchy 
sequence, then the same is true of Similarly, if {xj}'^i converges in 

M, then converges too, and to the same point. 

A subset of M is said to be dense in M if for every point x E M there 
is a sequence {xj}JLi of points in E which converges to x. This is equivalent 
to saying that for each x E M and each positive real number e there is a 
point y E E such that d{x, y) < e. Indeed, when this condition holds, one 
can simply choose Xj G E for each positive integer j such that d{xj, x) < 

If (M, d{x, y)) and (A^, p{u, v)) are metric spaces, then a mapping from 
M into N is said to be an isometry if 

(A.6) p{cj>{x),cj>{y))^d{x,y) 

for all x,y & M. Any metric space (M, d{x, y)) has a completion in the sense 
that there is a complete metric space (A", p{u, v)) and an isometry (p : M ^ N 
such that 

(A.7) 0(M) = {(Pix) -.xeM} 

is dense in A^. The completion is unique up to isomorphism, in the sense 
that if {N' , p'{z,w)) is another complete metric space and (p' is an isometric 
embedding of M into A"' whose image is dense in A"', then there is an isometry 
■0 from A^ onto A^' such that 

(A.8) = (p'ix) 

for all X G M. Basically one defines ip first on the image of M in A^ by this 
condition, and then shows that xp can be extended to an isometry from A^ 
onto A^'. 

If (M, d{x, I/)) is a metric space, then d{x, y) is said to be an ultrametric 

if 

(A. 9) d{x,w) < meix{d{x,y),d{y,w)) 

for all x,y,w G M, which is a stronger version of the triangle inequality. For 
instance, suppose that Fi,F2, . . ., is a sequence of finite sets, each with at 
least 2 elements, and let M be the set of sequences x = {xi}'^^ such that 
xi G Fi for each positive integer /. Let {pi}iZi be a sequence of positive 
real numbers which is strictly decreasing and converges to 0. li x — {xijiZi, 
y — {yi}i^i elements of M, then put d{x, y) — ii x — y and otherwise 
put d{x, y) — Pn, where n is the smallest positive integer such that x„ 7^ 
This defines an ultrametric on M. 
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B Compactness 



Let A be a set, and let {xj}'jLi be a sequence of points in A. By a subsequence 
of {xj}'jLi we mean a sequence of the form {a;^,}^]^, where {jijfli is a strictly 
increasing sequence of positive integers. In other words, we basically restrict 
the original sequence to an infinite subset of integer indices, arranged in 
increasing order. 

A sequence is automatically considered to be a subsequence of itself, and 
a subsequence of a subsequence of a sequence {xjjJLi is also a subsequence 
of {xj}'jLi. Observe that if {xj}'jLi is a sequence of points in A and if A is the 
union of finitely many subsets Ai, . . . , A^, then there is a g. 1 < q < n, and 
a subsequence {xji}'^i of {xjy'j^i such that Xji e Ag for all positive integers 
I. 

Let {M,d{x,y)) be a metric space. If {xj}^^ is a sequence of points in 
M which converges to a point x e M, then every subsequence {xji}iZi of 
{xj}"^^ converges to x. If {x^}"^^ is a Cauchy sequence in M, then every 
subsequence {xj^}'^^ of {xj}'^-^^ is also a Cauchy sequence. 

A subset £^ of M is said to be closed if for every sequence {xjj'jL^ of 
points in E which converges to a point x E M we have that x E M. This is 
equivalent to the requirement that if x is an element of M such that for each 
e > there is a y G M with d{x, y) < e, then y E E. The empty set and M 
itself are automatically closed subsets of M. 

If ^4 is a subset of M and p is a point in M, then that p is a limit point 
of A if for each r > there exists a point x E A such that d{p, x) < r and 
X ^ p. This is equivalent to saying that for each r > there are infinitely 
many elements of A whose distance to p is less than r. Equivalently, p is a 
limit point of A if there is a sequence of points in A, none of which are equal 
to p, and which converges to p. At any rate, a finite subset of M has no 
hmit points. A subset of M is closed if and only if it contains all of its limit 
points. 

For each p E M and nonnegative real number r, the set 
(B.l) {x E M : d{x,p) <r} 

is a closed subset of M. One can show this using the triangle inequality. 
The union of finitely many closed subsets of M is a closed subset of M. The 
intersection of any family of closed subsets of M is a closed subset of M. 

Let A be an arbitrary subset of M. The closure of A is denoted A and is 
defined to be the set of points x e M for which there is a sequence {xj}'^-^ 
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of points in A which converges to x. If x G A, then one can take Xj — x for 
all j, and thus we have that A C A. Equivalently, a point x E M lies in the 
closure of A if for every e > there is a point y E A such that d{x^ y) < e. 
This is also the same as saying that the closure of A is equal to the union of 
A and the set of hmit points of A. 

By definition, the closure of A is equal to A if and only if ^4 is a closed set . 
In general one can check that if A is any subset of M, then the closure of A is 
a closed subset of M. For if a; is a point in M which can be approximated by 
elements of A, then x can be approximated by elements of A, and therefore 
lies in A. 

A subset K of M is said to be compact if for every sequence {xj}^^ of 
points in K there is a subsequence {xj^}'^^ of {xj}'^-^^ which converges to a 
point in K. A subset K of M has the limit point property if every infinite 
subset E of K has a limit point which is an element of i^. It is not too 
difficult to show that compactness is equivalent to the limit point property. 
Namely, if K is compact and E is an infinite subset of K, one can choose a 
sequence of points in E in which no point occurs more than once, and any 
subsequential limit of this sequence is a limit point of E. Conversely, if K has 
the limit point property and {xj}'^!^ is a sequence of points in K, then either 
there is a subsequence of {xj}"^^ which is constant and hence convergent, or 
the set of x/s is infinite and one can check that a limit point of this set is 
also the limit of a subsequence of {xj}'^^. 

A compact subset of M is closed. For suppose that x is an element 
of M and {xjj'jLi is a sequence of points in K which converges to x. By 
compactness there is a subsequence {xjJ'iZi of {xjj^i which converges to 
a point in K. However, this subsequence converges to x, since the whole 
sequence converges to x, and it follows that x E K. 

U Ki, . . . , Kn are compact subsets of M, then the union 

(B.2) K ^ KiUK2U---UKn 

is also a compact subset of M. Indeed, if is a sequence of points in K, 

then there is a. q, 1 < q < n, and a subsequence {xj^}iZi such that Xj^ e Kg 
for all I. The compactness of Kg then implies that there is a subsequence of 
{xjiyiZi which converges to a point in Kg. This subsubsequence of {a;^}"?,^ is 
also a subsequence of {xj}'^!^, and it converges to a point in K, as desired. 

Suppose that E, K are subsets of M, with E C K, E closed, and K com- 
pact. Under these conditions E is also compact. For suppose that {xj}'^^ is 
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a sequence of points in E. Because K is compact, a subsequence of {a^^j}^^! 
converges to a point in K. Because E is closed, the limit of this subsequence 
lies in E. 

A subset of M is said to be bounded if there is a positive real number 
r such that d{x, y) <r for all x,y E E. Equivalently, E is bounded if there 
exists p E M and t > such that d{p, x) < t for all x E E. This is also 

equivalent to the condition that for each p E M there is a t{p) > so that 
d{p, x) < tip) for all x E M. Observe that the union of finitely many bounded 
sets is bounded. 

If £^ is a nonempty bounded subset of M, then the diameter of E is 
denoted diamE' and defined to be the supremum of the numbers d{x, y) for 
x,y E E. The closure of a bounded set is also bounded, and has the same 
diameter, assuming that it is not empty. The diameter of a union of two 
bounded subsets of M is less than or equal to the sum of the diameters of 
the two subsets if the two subsets have a point in common, and if the metric 
is an ultrametric, then the diameter of the union is less than or equal to the 
maximum of the diameters of the two subsets. 

A compact subset i^' of M is bounded. Indeed, let p be any element of M. 
If K is not bounded, then for each positive integer n there is a point Xn E K 
such that d{p, Xn) > n. In this case {xn}'^=i is a sequence of points in K 
for which there is no convergent subsequence, contradicting the assumption 
that K is compact. 

A subset of M is said to be totally bounded if for each e > there 
exist finitely many points pi, . . . ,pi E M so that for each x E K we have 
d{x,pj) < e for at least one j. Equivalently, E is totally bounded if it can be 
expressed as the union of finitely many subsets of arbitrarily small diameter. 
The closure of a totally bounded subset of M is also totally bounded. 

Compact subsets of M are totally bounded. Indeed, assume for the sake 
of a contradiction that i^T is a compact subset of M which is not totally 
bounded. In this case there is an e > and a sequence of points {xi}'^^ in 
K such that d{xi,Xm) > e when I < m. Clearly no subsequence of {xi\'j^i 
can converge. In fact, no subsequence of {xi}fl^ can be a Cauchy sequence. 

If is a subset of M which is totally bounded, and if {xi}fl^ is any 
sequence of points in E, then for each e > there is a subsequence of {a;;};^^ 
contained in a subset of E of diameter less than e. This is easy to see by 
expressing as a union of finitely many subsets each with diameter less than 
e. One can go a bit further and say that if is a totally bounded subset of 
M and {xi}'^-^ is a sequence of points in E, then {xi}"^-^ has a subsequence 
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which is a Cauchy sequence. This uses a Cantor diagonahzation argument. 

In short, a subset of M is totally bounded if and only if every sequence 
of points in E has a subsequence which is a Cauchy sequence. If M is a 
complete metric space, then a subset of M is compact if and only if it is 
closed and totally bounded, by the preceding observations. In any metric 
space, a Cauchy sequence with a convergent subsequence converges to the 
same limit as the subsequence, and thus a Cauchy sequence contained in a 
compact set converges. It follows that a metric space M is compact as a 
subset of itself if and only if it is complete and totally bounded. 

An interesting class of examples is provided by the spaces mentioned at 
the end of Appendix consisting of sequences with the Ith term in a fixed 
finite set Fi for each /. For these spaces one can verify completeness, total 
boundedness, and compactness quite concretely. A sequence of elements 
in one of these spaces is a sequence of sequence, and convergence basically 
amounts to convergence of the individual terms in the F;'s. 

C Topological spaces 

Let X be a set, and let r be a collection of subsets of X, called the open 
subsets of X. We say that r defines a topology on X, so that X becomes a 
topological space, if the empty set and X itself are open subsets of X, if the 
intersection of finitely many open subsets of X is again an open subset of X, 
and if the union of any family of open subsets of X is an open subset of X. 
The condition about unions is equivalent to saying that if is a subset of 
X and if for each p E W there is an open subset U of X such that p E U 
and U C W, then W is an open subset of X. 

Let {M,d{x,y)) be a metric space. If p E M and r > 0, then the open 
ball with center p and radius r is denoted B{p, r) and defined to be the set 
of X E M such that d{p, x) < r. The closed ball with center p and radius r 
is denoted B{p, r) and defined to be the set of x E M such that d{p, x) < r. 
A subset f/ of M is said to be open if for each p E U there is an r > such 
that B{p, r) C U. It is easy to check that this defines a topology on M and 
that every open ball in M is an open subset of M. 

If (X, r) is a topological space, then a subset of X is said to be closed 
if the complement X\E of E in X, consisting of the points in X which do 
not lie in E, is an open subset of X. It follows that the empty set and X 
itself are closed subsets of X, that the union of finitely many closed subsets 
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of X is a closed subset of X, and that the intersection of any family of closed 
subsets of X is a closed subset of X. In a metric space a subset is closed in 
the sense defined in Appendix El if and only if it is closed in the sense that 
its complement is open. 

If A is an arbitrary subset of X, then the closure of A is denoted A and 
defined to be the set of points p G X such that for each open subset f/ of X 
with p G f/ we have that the intersection of A and U is nonempty. Thus the 
closure of A contains A automatically, and in a metric space this definition 
of the closure is equivalent to the one in Appendix El One can check that 
A = A a and only if A is a closed subset of X, and that A always is a closed 
subset of X. 

If X, F are topological spaces and / is a mapping from XtoY, then we 
say that / is continuous at a point p G X if for each open subset W of Y 
such that f{p)EW there is an open subset ^7 of X such that p E U and 

(c.i) ucr\w). 

Recall that f~^{W) is by definition the set of points x G X such that f{x) G 
W. If / : X — > y is continuous at every point in X, then we say that / is a 
continuous mapping from X to Y. This is equivalent to saying that f^^iV) 
is an open subset of X for every open subset V of Y, or that f~^{E) is a 
closed subset of X for every closed subset EofY. 

Let X and Y be sets, and let / be a mapping from X to Y. We say that 
/ is injective or one-to-one if for each pair of points xi,X2 G X with xi ^ X2 
we have that f{xi) ^ f{x2)- We say that / maps X onto Y if for each y &Y 
there is an x G X such that f{x) = y. Thus / is a one-to-one mapping of X 
onto Y if and only if there is an inverse mapping h from y to X such that 

(C.2) h{f{x)) = X 

for all X G X and 

(C.3) f{h{y)) = y 

for all y eY . The inverse mapping is unique when it exists, and is denoted 

Let X, Y be topological spaces, and let / be a one-to-one mapping from 
X onto Y . We say that / is a homeomorphism if / is a continuous mapping 
from X to y and is a continuous mapping from Y to X. Equivalently, / 
is a homeomorphism if / sends open subsets of X to open subsets of Y and 
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sends open subsets of Y to open subsets of X. One could just as well 
use closed subsets instead of open subsets here. 

A subset of a topological space X is said to be compact if every open 
covering of K admits a finite subcovering. Recall that an open covering of a 
subset X of X is a family {Ua}aeA of open subsets of X whose union contains 
X as a subset. Thus K is compact if for each open covering {Ua}aGA of K 
there is a finite subset Ai of A such that K is contained in the union of the 
UaS with a E Ai. Finite subsets of X are automatically compact. 

If X, Y are topological spaces, / is a continuous mapping from X to Y, 
and X is a compact subset of X, then f{K) is a compact subset of Y, where 
f {K) is by definition the set of points in Y of the form f{x) for some x E K. 
Indeed, suppose that {V^jagA is an arbitrary open covering of f{K) in Y. 
Then {f^^{Va)}aeA is an open covering of in X. Because K is compact, 
there is a finite subset Ai of A such that K is contained in the union of 
f ^^{Va), Oi e Ai. This implies that f{K) is contained in the union of 14, 
a e Ai. 

A topological space X is said to satisfy the first axiom of separation if 
subsets of X with exactly one element are closed subsets. This is equivalent 
to saying that if x,y G X and x ^ y, then there is an open subset U of X 
such that X e U and y ^ U. If X satisfies the first axiom of separation, then 
every finite subset of X is a closed subset of X. A topological space X is said 
to satisfy the second axiom of separation if for every x,y E X with x ^ y 
there are open subsets U ^ V of X such that x E U, y E V , and [/ fl V = 0. 
One also calls X a HausdorfF topological space in this case, and of course the 
second axiom of separation implies the first axiom of separation. 

Let X be a Hausdorff topological space, let X be a compact subset of X, 
and fix a point x G X\K. For each y E K there are open subsets U{y), V{y) 
of X such that x G U{y), y G V{y), and U{y), V{y) are disjoint. Because K 
is compact there are finitely many points yi, . . . ,yn E K such that 

(C.4) KCV{yi)U---UV{yn). 

Thus 

(C.5) U{y,)n---U{yn) 

is an open subset of X which contains x and is contained in the complement 
of K. It follows that X is a closed subset of X. 

In order for compact subsets of a topological space X to be closed, it is 
obviously necessary for X to satisfy the first axiom of separation, since finite 
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subsets of X are compact. The first axiom of separation is not sufficient, 
as one can show by examples. For this reason the term "quasicompact" 
is sometimes used for compact subsets in topological spaces which are not 
necessarily Hausdorff. 

For instance, let X be the real line R together with an extra point 0'. Let 
us write R' for the set obtained from R by removing and adding 0'. One 
can define a topology on X so that R and R' are open subsets of X which 
are each homeomorphic to the real line with the standard topology. The 
resulting space X satisfies the first axiom of separation but not the second 
one, because if U, V are open subsets of X such that E U and 0' G V, then 
U and V have elements in common, namely nonzero real numbers close to 0. 
Closed and bounded intervals in the real line are compact, and they give rise 
to compact subsets of R and R' which are compact subsets of X, but they 
may not be closed because of containing and not 0' or vice-versa. 

In any topological space X, the union of two compact sets Ki, K2. For 
suppose that {Ua}a£A is an open covering of Ki U Then this is also an 
open covering of Ki, K2 individually, and thus there are finite subsets Ai, 
A2 of A such that Ki is contained in the union of the [/„'s with a E Ai and 
K2 is contained in the union of the UaS with a & A2. Therefore Ai U A2 is 
a finite subset of A such that Ki U K2 is contained in the union of the UaS 
with a e AiU A2. 

In any topological space X, if F is a closed subset of X and i^T is a 
compact subset of X, then the intersection of F and K is compact. Indeed, 
let an arbitrary open covering of F fl be given. Since F is closed, X\F is 
open, and we can add this open set to the open covering of F n to get an 
open covering of K. The compactness of K implies that finitely many open 
subsets in the original open covering of F (1 K together perhaps with X\F 
covers K. Hence these finitely many open subsets from the original covering 
oi FnK covers FnK. 

If (M, d{x, y)) is a metric space, then a subset i^' of M is compact in the 
sense of open coverings if and only if it is compact in the sense of Appendix iBl 
To see this, assume first that K is compact in the sense of open coverings, and 
let E be any infinite subset of K. If we assume for the sake of a contradiction 
that E does not have a limit point in then for each p & K there is 
then a positive real number r{p) such that E fl B{p,r{p)) is finite. Since K 
is compact in the sense of open coverings, K is contained in the union of 
B[p,r{p)) for finitely many p & K, which implies that E is finite. Notice 
also that compactness in the sense of open coverings immediately implies the 
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property of being totally bounded. 

Conversely, suppose that A' is a totally bounded subset of M and that 
every Cauchy sequence in K converges to a point in K. Suppose too that 
there is an open covering of K which does not admit any finite subcovering. 
Using the total boundedness of K one can show that there is a sequence 
Ki, K2, . . . ol closed subsets of K such that i^'n+i C for all n, the diameter 
of Kn is less than 1/n for all n, and no finite subcollection of open sets from 
our covering of K covers any Kn- Using the convergence of Cauchy sequences 
one can show that there is a point p E K such that p G Kn for all n. Hence 
p is contained in one of the open subsets in the covering of K, which then 
contains K^ for sufficiently large n, a contradiction. 
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